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ABSTRACT 


Due to high nonlinearity with features of large time constants, delays, and 
interaction among variables, control of the wastewater treatment plants 
(WW'TPs) is a very challenging task. Modern control strategies such as model 
predictive controllers or artificial neural networks can be used to deal with 
the non-linearity. Another characteristic of this system should be considered is 
that it works repetitively. Iterative learning control (ILC) is a potential 
candidate for such a demanding task. This paper proposes a method using ILC 
for WWTPs to achieve new results. By exploiting data from the previous 
iterations, the learning control algorithm can improve gradually tracking 
control performance for the next runs, and hence outperforms conventional 


(BSM) control approaches such as feedback controller and model predictive control 
(MPC). The benchmark simulation model No.1-BSM1 has been used as 
a standard for performance assessment and evaluation of the control strategy. 
Control of the Dissolved Oxygen in the aerated reactors has been performed 
using the PD-type ILC algorithms. The obtained results show the advantages 
of ILC over a classical PI control concerning the control quality indexes, IEA 
and ISE, of the system. Besides, the conventional feedback regulator is 
designed in a combination with the iterative learning control to deal with 
uncertainty. Simulation results demonstrate the potential benefits of 
the proposed method. 
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1. INTRODUCTION 

A wastewater treatment plant (WWTP) is an industrial system encompassing mechanical, physical, 
chemical and biological processes in order to remove pollutants from the inlet wastewater [1]. The complexity 
of the biological and biochemical processes and the strong fluctuations of the influent flow rate make 
the control problem of the WWTPs very challenging [2]. The technology using the activated sludge process 
(ASP) is applied widely in advanced WWTPs [3, 4]. This technology is inexpensive and can be adapted to 
different types of wastewater. The WWTPs using ASP is described in Figure 1 [5]. Municipal wastewater is 
treated in an aeration tank, then flows to a sedimentation tank, where the biomass sludge is recovered. Treated 
water, located at the top of the sedimentation tank, will be the output of the system. At the bottom of the tank, 
the sludge deposits, in which a small part is returned to the aeration tank, while the waste activated sludge is 
pushed out. 
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The schematic of the WWTP is shown in Figure 2 [6]. The biological reactor has five tanks, including 
two anoxic sections (pre-nitrification) and three aerated ones (nitrification) [7, 8]. The sludge is recycled from 
the clarifier into the anoxic tank (external recycle), and a part of the mixed liquor is also fed back to 
the biological reactor (internal recycle) to maintain the microbiological population. The sludge withdrawn is 
pumped continuously from the settler to keep the sludge concentration constant. In order to improve the ASP 
efficiency, there are three things we can do to control this process: the air, or aeration rate; the sludge wasting 
through waste activated sludge; and the sludge recirculation, either through return activated sludge and/or 
internal mixed liquor recycle for nutrient removal. Aeration is an important part of biological reactors because 
aerobic conditions will facilitate the development of a wide range of microorganisms. Aerobic conditions for 
the growth of biomass affect the control of DO in biological wastewater treatment by activated sludge method. 
Inadequate or excess oxygen in aerobic tanks will result in degradation of activated sludge. To maintain 
the desired aeration in the biological tank, a DO controller is implemented. Besides, the DO level in the fifth 
tank is controlled that manipulates the aeration coefficient for this basin K?,. Besides, an outer control loop is 
used to control the nitrate removal by manipulating the internal recycle flow-rate. 
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Figure 1. Municipal wastewater treatment by activated sludge process 
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Figure 2. schematic representation of the WWTPs 


To keep favorable processing conditions for the required treatment results and cost-effectiveness, 
different control strategies are applied. For example, in the aeration section, the aeration is controlled based on 
the difference between DO desired value and the measured output. Besides, control algorithms are designed to 
adjust the position of the aeration valve and the operation of the air compressor. Practically, conventional PID 
is the most used controller in wastewater treatment [9], because of its simplicity and capability of providing 
acceptable performance. Its disadvantage, however, is that correction is active only when the measured output 
differs from the setpoint. A combination of feedforward and feedback regulator has been proposed for DO 
concentration control [10, 11]. This method can provide a more stable, responsive, and reliable control system. 
The basic idea of feedforward control is based on the notion that major disturbances are measured and 
compensated before they have time to upset the system. In WWTPs, however, it is very difficult to measure 
disturbances as well as to derive a mathematical model. 

To increase the efficiency and reduce costs, a number of advanced controllers have been proposed 
such as cascade control loops [12], intelligent controllers based on Hedge algebras [13], fuzzy logic 
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control [14, 15], Model predictive control [16, 17]. These aforementioned methods, however, are non-learning 
algorithms. Artificial neural networks are proposed in [18]. Neural network learning, however, modifies 
the controller parameters. Therefore, modifying large networks of nonlinear neurons requires extensive training 
data. Whereas, ILC is another type of learning strategy that adjusts the control input, which is a signal, by 
gathering data from previous executions [19]. WWTPs work in a repetitive manner. Therefore, if the same 
manipulated variables are used for every iteration, the control performance cannot be improved. Therefore, 
data from previous runs is wasted. The main idea of ILC is that the performance of a system that operates in 
repetitive/repeatable maneuvers can be enhanced by incorporating and learning information from previous 
iterations. To our best knowledge, ILC has not been deployed in such systems. This paper aims to propose 
a method using ILC for WWTPs to achieve new results in DO control performance. Compared to feedback, 
feedforward controllers, ILC has some advantages [20]. A feedback regulator has to react to inputs and 
disturbance, results in a transient tracking lag. To eliminate this lag, a feedforward controller can be used, but 
only for measurable or known signals. ILC is anticipatory and able to compensate for exogenous signals, such 
as repetitive disturbances by learning from previous operations. In ILC, the exogenous signals are not required 
to be known or measured but repeated from iteration to iteration. Due to ILC cannot address unanticipated and 
nonrepeating disturbances, a combination of feedback regulator and with ILC 1s proposed to use. The remainder 
of this paper is organized as follows. The proposed control algorithm is introduced in section 2. Then, 
a combined ILC and PI regulator for WWTPs is derived in section 3. Next, section 4 presents simulation results 
and discussion. Finally, the conclusion is given in Section 5. 


2. THE PROPOSED CONTROL ALGORITHM 

The main idea of ILC is to utilize the situation that the control system carries out the same task over 
and over again. Based on that, the performance of the control system could be improved gradually by using 
the results from the previous iterations when updating the input signal for the next iteration. Figure 3 shows 
the basic structure of an ILC. The input signal U, (s) and the error signal E;(s) between the reference trajectory 
Y,(s) and system output Y; (s) are stored in memory. The input signal for the next iteration is computed based 
on U;(s) and E;(s) to improve the system performance. That is: U;4,(s) = f (U; (s), E;(s)). A popular ILC 
algorithm is [21]: 


Uj+1(S) = U;(s) T Le(s)E; (s) (1) 


where L,.(s) is a learning function. 

It should be mentioned that ILC is an open-loop approach and has no feedback mechanism to deal 
with nonrepeating and unanticipated disturbances. Therefore, we proposed to use a feedback regulator 
combined with ILC for the DO control in tank 5, as shown in Figure 4. The basic idea here is that the system 
performs the same movement repeatedly, and a correction signal AU; is updated after each iteration. 
In Figure 4, C is a feedback PI regulator, and P is the plant. 
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Figure 3. Block diagram of basic iterative Figure 4. ILC combined with a feedback regulator 
learning control 
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From the block diagram, the input signal is given by: 
U,(s) = G.(s).(Ya(s) — ¥(s)) + AU;(s) (2) 


in which G,(s) is a feedback transfer function. So, the output of the closed-loop system: 


VCs) = o (Gels). Yas) + AY;(s)) 3) 


where Gy (s) is the plant transfer function. The update equation: 

AU; (s) = U;(s) + Le(s). E;(s) (4) 
in which the learning function is PD-typed as following : L,(s) = K, + Kgs 

Convergence is a major issue in ILC. Convergence means that the iterative update of the input signal 
converges to a signal giving a good performance. In the following, conditions for convergence to zero error 
will be derived. From Figure 4, the error signal of the iterative j 1s defined: 


E; (s) = Ya(s) — Y; (s) (5) 


inserting (3) into (5) gives: 
E;(s) = G(s). G(s). (G7 (s). Ya (s) — AU; (s) ) (6) 


where: 


G-(S).Gp(s) 
G(s) = ee 


 14Ge(s).Gp(s) we 

is the closed-loop transfer function. At the iterative j+1, the control signal 1s: 

Uj+i(s) = U; (s) + Le(s). Ej(s) (8) 
similar to (6), we have: 

Eja1(s) = G(s). G7 (8). (Gy (8). Ya (s) — AUj+1 (s) (9) 
inserting (8) into (9) gives: 

Ej41(s) = Ej(s) — G(s). G,*(s). Le(s). E; (s) (10) 
hence: 

Eye (s) = (1 = G(s). G7 (s). Le(s)) Bi(s) (11) 


we see that with |1 — G(s). G1 (s). L,(s)| < 1 Y æ, the error will tend to zero, and hence to output signal 
will follow the reference exactly. The condition in (11) means that the Nyquist diagram G(jw)G,'(jw)L(Gjw) 
has to be inside a circle of radius one with the center at one. This circle is denoted learning circle. 


3. COMBINED ILC WITH A PI REGULATOR FOR WWTPs 

In this project, the proposed method is applied to the BSM1, which is based on the most popular 
Activated Sludge Model No.1 (ASM1) developed by the International Association on Water Pollution 
Research and Control. The BSM1 is the standard model [7] to model, assess performance, and evaluate control 
strategies [8]. 
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Figure 5 shows the Simulink model, including a nitrate controller and a DO controller. In former, a PI 


controller is pre-implemented. In the latter, another pre-designed PI controller is applied in the first iteration, 

then ILC combined with a PI regulator will be used in next iterations. In the first iteration, AU; (s) = 0, so only 
the PI feedback regulator is active. The control signal U_last and the measured output O2_meas are sent to 
the workspace in Matlab. In Matlab programming, the updating control signal is designed and calculated 
according to (4), then sent back to the Simulink model using input terminals Delta_u to run the next iteration. 
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Figure 5. Simulation model 


To evaluate the performance of the system, control quality evaluation will be used. In ILC, 
the converged error E,,(s) and the initial error Eo (s) is compared, using the differential integration method 


(IAE) and integrating the square of control deviation [7], in which: 
(12) 


t t 
IAE; = f |E;|dt and ISE; = J,’ E} dt 


to evaluate the performance of the system, operating cost index (OCI) and output wastewater quality index 


(EQI) are used, according to [8]: 
= 1 t=14 days 
EQI =—— ice day 4-Qe(t)dt (13) 





where: 
A = Brss. TSS, (t) + Bcop.COD,(t) + Bren: TKN,(t) + Bno: Sno, (t) + Bgops. BOD; (t) 


In which TSS, COD, TKN,, S NOe> and BOD, are the total amount of solids, the demand of 
the chemical oxygen, the total amount of nitrogen, the demand of the nitrate concentration and biological 
oxygen in the outlet, respectively. Q,(t)is the flow rate of outlet, and T is time (14 days in simulation) 


Brss = 2, Bcop = 1, Brgy = 30, Byo = 10, and Bgops = 2 are coefficients. 
(14) 


OCI = AE + PE +5.SP+3.EC + ME 
where AE is aeration energy, EC is the mixing energy, EC is the consumption of external carbon 
source, PE is the pump energy and SP is the total amount of discharged sludge [7]. 
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4. RESULTS AND ANALYSIS 

Figure 6 demonstrates the simulation result of the DO control responding to a constant setpoint, in 
dry weather, with a PI controller, and with the proposed ILC controller. The Figure show that while a PI 
controller still gives a large tracking error, up to about 2.5 [g.(COD)/m3], the ILC combined PI gives a much 
better tracking control in the second iteration, and to nearly zero error after 10 iterations. This leads to very 
small control quality indexes 
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Simulation result of the learning process is given in Figure 7, with 3 different sets of learning 
coefficients: set 1: Kp ,,,=5,Kqa,, = 2; set 2: Kp ae = 0-9, Ka ac = 0.2; and set 3: K, ae = 40, Ka ac = 10. 
The figure illustrates that both JAE and ISE decrease significantly after each iteration. The rate of decreasing 
can be influenced by Kp ic and Kg ic of the learning function L,(s). 

An importance aspect needs to be considered when applying ILC is the convergence. That 1s, 
the iterative update of the control signal converges to a signal giving a good performance. In [22], a detail of 
convergence issues was discussed, and some convergence criteria were derived. In this project, the learning 
process can be continued until the desired performance is reached. 

Different weather conditions are also used to investigate the advantage of the proposed controller 
concerning disturbance rejection. Table 1 gives a comparison of the proposed controller with a Pl-only 
regulator and with the method using a model predictive controller combined a feedforward [2] in term of 
control quality indexes. The Table shows that the proposed controller gives the best control performance with 
the smallest JAF and ISE indexes in all three considered weather conditions. 


Table 1. Comparison of quality indexes of different controllers in dry weather 
Indexes Dry weather Rainy weather Stormy weather 
Method IAE ISE IAE ISE IAE ISE 


PI 0.25105 0.022085 0.2299 0.0177 0.2601 0.0217 
MPC+FF [2] 0.047 0.00067 - 0.0013 - 0.0018 
PI+ILC 0.0098 0.000078 0.0082 0.000054 0.0081 0.000058 


To improve the system performance, a hierarchical control is proposed by some 
investigations [2, 4]. Higher-level control is designed to regulate the DO set-points, using states of the process, 
usually Syz4 and Syo concentration values in any tank or the inlet [23, 24] or DO in other basins [25, 26]. 
The simulation of the proposed control algorithm for a regulated setpoint is given in Figure 8. Similar to 
a constant setpoint, the tracking control performance with a regulated setpoint is also nearly zero error. 
Table 2 compares the system quality indexes (OCI and EQI) of the DO control using constant setpoint and 
regulated setpoint. The table illustrates that using regulated setpoint, not only OCI but also EQI are decreased 
significantly comparing those using the constant setpoint in all three considered weather conditions. 
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Figure 8. Control performance with regulated setpoint 


Table 2. Comparison of quality indexes of different controllers in all three weather 


Method ILC+PI 

Indexes Constant setpoint Regulated setpoint % 
Dry OCI 16364.77 16298.87 - 0.4027 % 
weather EQI 6095.23 6002.08 - 1.5282 % 
Rainy OCI 15996.38 15916.48 - 0.4995 % 
weather EQI 8174.91 8109.56 - 0.7994 % 
Stormy OCI 17236.97 17188.39 - 0.2818 % 
weather EQI 1228.12 7124.08 - 1.4476 % 


5. CONCLUSION 

The presented ILC performance of the nitrate concentration in the DO concentration control in 
the last aerated reactor of the pre-denitrification WWTP has shown promising results compared to 
the traditional decentralized PI control as well as to the MPC combined with feedforward controller. 
The combination of ILC with a PI feedback regulator provides a significant improvement of the WWTP 
Operation aimed at organic and ammonium pollutants removal proved in the presence of the weather 
disturbance. The proposed control algorithm is proven for both constant setpoint and regulated setpoint. 

The proposed controller in this project has just applied at tank No. 5. For further reduction of 
the OCI and the EQI, similar control structures can be designed to control oxygen concentration for the other 
basins (tank 3 and tank 4), and the nitrate control. Furthermore, the proposed PD-typed ILC 1s not an optimal 
algorithm. Optimal ILC, robustness, and implementation of ILC in the multiple-input multiple-output (MIMO) 
setup are our ongoing works for controlling a larger number of the WWTP variables. As an optimal ILC 
may successfully work in the presence of constraints, for both manipulated and controlled variables, 
the proposed control design outperforms the traditional control approach and reveals incentives for 
its practical implementation. 
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